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Abstract — Atta sexdens rubropilosa is an important leaf- 
cutting ant species considered as a pest in agricultural 
crop or reforestation areas. Quantitative real-time 
reverse transcriptase-polymerase chain reaction (RT- 
qPCR) is a technique that can help us to understand the 
regulation and the function of a gene. However, its 
reliability depend on the data normalization. Different 
normalization strategies can be adopted for qPCR, 
reference genes has been cited as one of the most effective 
methods. It has not been identified a universal reference 
for all organism and experiment. In this way, the 
validation of reference gene is crucial step. This is the 
first study to evaluate reference genes for leaf-cutting 
ants. To this, we analyzed the expression levels of 
candidate reference genes (act, efl -alpha, efl-beta, 
GAPDH and rpI18) in different developmental stages 
(larva, pupa and worker ) and tissues (head, mesosoma 
and worker without gaster) of A. sexdens rubropilosa. 
Four different algorithms (BestKeeper, geNorm, 
NormFinder and comparative ACt method) were used in 
statistical analysis of the stability of the genes and 
RefFinder was used to propose a consensus list for 
ranking the reference genes. Our results showed that the 
most suitable combinations of reference gene candidates 
were rpll8 and efl -alpha for the different developmental 
stages and rpll8 and efl-beta for the different tissues. In 
this work, we also report the obtaining from a putative 
acetylcholinesterase from A. sexdens rubropilosa 
( GenBank KY464935), which was used as a target gene to 
confirm the reliability of reference genes suggested. 
Keywords — Acetylcholinesterase, Atta sexdens 
rubropilosa, Developmental stages, Reference gene, 
RT-qPCR, Tissues. 

I. INTRODUCTION 

Insects are the dominant animals in most terrestrial 
ecosystems, both in number of species and biomass. 
Among them, social insects present colonies with large 
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numbers of individuals and, consequently, greater 
biomass [1], The diversity of ant species indicates that 
they rank among the most successful insects. It is 
estimated that 40.000 ant species exist in the world, of 
which about 16.000 species and subspecies have already 
been formally described [2]. All ants are considered 
eusocial. The ecological significance of ants is 
indisputable; however, as mankind changes the 
environment for agricultural or forestry development or 
for the construction of cities, the environment becomes 
less complex and there is a decrease in biodiversity; 
although, on the other hand, opportunistic animals 
(generalists) are favored [3], Among them are some 
species of ants, which increase in population density and 
can adversely affect human activities [4, 5]. 

Although only few ant species are considered pests (less 
than 1% of the known species), the economic losses 
caused by them can be large, especially considering those 
that occur in silviculture and agriculture, both in the 
production and storage of food [6], Among the 
economically important ant species in Brazil, leaf-cutting 
ants stand out. They are distributed throughout the 
Americas and cause major damage, particularly in South 
America [4, 5]. Leaf-cutting ants are the main herbivores 
present in the Neotropics and are also considered as pests 
in agricultural crop or reforestation areas [7], 

In addition to the losses that leaf-cutting ants cause to 
agriculture, silviculture, and pastures, there are the 
environmental problems and poisoning of other animals, 
including humans, caused by excessive use of pesticides 
in the attempt to control these ants. One approach for the 
development of new ways to control this problem, while 
minimizing the damage to the environment, is causing the 
silencing of a specific gene. 

Reverse transcription - quantitative real-time polymerase 
chain reaction (RT-qPCR) is a technique that can help us 
to understand the regulation and the function of a gene. 
Different normalization strategies can be adopted for 
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qPCR; however, the use of a reference gene has been 
cited as one the most effective methods, since the 
reference gene undergoes the same steps as the target 
gene, correcting errors and differences in the sample [8, 
9], Several works have demonstrated the importance of 
choosing a proper reference gene and impact of using 
such not appropriated gene. Incorrect results might be 
obtained due to misinterpretation of RNA transcription 
levels especially for low abundance gene transcripts [10, 
11]. To date, several works were developed to determine 
the reference gene in Insecta, which demonstrate that is 
impossible to find a universal reference gene able to 
covering all organism and conditions [11, 12, 13, 14, 15, 
16, 17], 

The choice of a gene as reference gene is not trivial and 
starts with the selection of candidate reference genes to be 
analyzed [18]. Housekeeping genes (HKG) are usually 
first selected to be investigated as reference genes due to 
the assumption that they are involved in essential 
processes for the survival of cells and are expected to be 
expressed in a stable and nonregulated level [19]. A 
reliable reference gene should exhibit an expression level 
not affected by experimental factors, with minimal 
variability between tissues and physiological states and a 
Ct (Cycle Threshold) similar to the target gene [9]. The 
most studied reference genes, GAPDH and 18s rRNA, are 
not always expressed in a constant manner. In addition, 
their expression can be altered depending on the 
organisms and their life stages [9, 12]. 

A good strategy for selecting potential candidate 
reference genes is based on previous data from species 
relative to the studied specie due to the high degree of 
similarity between genomes and the expectation of a 
similar expression level [9]. However, differences in 
stability have been verified in the analysis of a reference 
gene in Insecta for organisms from the same order [11, 
12, 14, 15], family [20, 21] and even for those of the same 
genus [14, 16, 21], This justifies studies to validate 
reference genes for an organism and experimental 
conditions before the analysis for precise mRNA 
quantification [22]. 

We believe that this is the first study to evaluate reference 
genes for leaf-cutting ants. Genome from Acromyrmex 
echinatior [23] and Atta cephalotes [24], both leaf-cutting 
ants, are available in the database but there is no validated 
gene(s) for leaf-cutting ants. In the present study, seven 
candidate reference genes from A. sexdens rubropilosa 
were selected: actin {act), elongation factor 1 -alpha (efl- 
alpha), elongation factor 1-beta ( efl-beta ), 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH), 
ribosomal protein L18 ( rpll8 ), TATA box binding protein 
(tbp), and 18S ribosomal RNA (18S rRNA). The 
expression level and stability of them, except tbp and 18S 
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rRNA were examined. The stability of these candidates 
was investigated in three A. sexdens rubropilosa 
developmental stages (larva, pupa and worker) and in the 
following parts of the insect: head, mesosoma, gaster, and 
worker without gaster. 

To validate the results the expression profile of a putative 
target gene, acetylcholinesterase from A. sexdens 
rubropilosa, was investigated. Acetylcholinesterase 
(AChE, EC 3. 1.1. 7) is a serino hydrolase that hydrolyzes 
and inactivates the neurotransmitter acetylcholine 
controlling the cholinergic signal transmission in the 
synapse [25], The evaluation with the target gene 
emphasize the importance to validate the reference gene 
as internal control in genomic research and the results 
presented will be useful for further works in this field for 
leaf-cutting ants. 

II. MATERIAL AND METHODS 

2.1 Biological samples 

The A. sexdens rubropilosa Forel (Hymenoptera: 
Formicidae) was collected from laboratory nest localized 
in the Center of Studies on Social Insects (UNESP, Rio 
Claro, Brazil). The nest was supplied daily with leaves of 
Eucalyptus alba, oat seeds and occasionally with the 
leaves of other plants such as Hibiscus sp., Ligustrum sp. 
or rosebush petals. 

Developmental stages samples were picked from the nest: 
10 larvae, 10 pupae and 10 workers were collected for 
each replicate, washed with RNase-free phosphate- 
buffered saline (PBS) and stored at -80 °C until used. 
Tissue samples were dissected from workers: 10 heads, 
10 mesosomata, 10 gasters and 10 workers without gaster 
for each replicate, followed by wash with PBS and stored 
at -80 °C until RNA extraction. All samples were 
collected in triplicate (biological triplicate). 

2.2 RNA extraction and cDNA synthesis 

Total RNA from larvae, pupae and workers without 
gaster was extracted using a combined method with 
TRIzol® (Thermo Fisher Scientific) and PureLink® RNA 
mini Kit (Thermo Fisher Scientific). For tissue samples 
from workers, head, mesosoma and gaster, only the 
PureFink® kit was used. The manufacturer’s protocol 
was followed for both applications. Total RNA from each 
sample was diluted in 10 mM Tris-HCl pH 7.5 and the 
quantity and quality of the samples were determined by 
the 260/280nm and 260/230nm ratio using a BioSpec- 
nano (Shimadzu-biotech). The RNA integrity was 
analyzed by agarose denaturing gel 1.2 % (w/v) and 
confirmed by the intense ribosomal RNA bands and the 
absence of smears. The total RNA was treated with 
DNase (DNasel, RNase-free -Thermo Fisher Scientific) 
to eliminate potential genomic DNA contamination. 
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First-strand complementary DNA (cDNA) was 
synthetized using 1.35 pg of total RNA with 
Superscript® VILO Master Mix (Thermo Fisher 
Scientific) with 20 pi final reaction volume, following 
instructions from the manufacturer. The synthesis of 
cDNA was performed in triplicate for each sample 
(replicate) and the product was stored at -20°C for later 
use. 

2.3 Selection and procedure for obtaining the sequence 
of candidate reference genes and the putative 
acetylcholinesterase gene 

Seven genes were selected as candidate to reference 
genes: actin {act), elongation factor 1-beta ( efl-beta ), 
elongation factor 1 -alpha (e/7 -alpha), glyceraldehyde-3- 
phosphate dehydrogenase ( GAPDH ), ribosomal protein 
L18 ( rpl!8 ), TATA box binding protein (tbp), and 18S 
ribosomal RNA (18S rRNA). Acetylcholinesterase 
(AChE) was used as a target gene. 

A. sexdens rubropilosa genome is not yet known, for this 
reason, the sequences of most of these genes are not yet 
deposited. Sequence alignments for every gene from 
several species of ants were performed and conserved 
sequence regions were used to design specific and/or 
degenerated forward and reverse primers (Table 1). 

The DNAs were amplified by PCR performed with 1 pL 
cDNA; 1 pM for the specific primer and 2 pM for the 
degenerated one; 0.2 mM of dNTPs and 1.25 U of Pfu 
DNA Polymerase (Thermo Fisher Scientific) in 25 pL 
final volume. PCR amplification was performed using the 
following program: 3 min at 95 °C followed by 40 cycles 
of 1 min at 95 °C, 90 s at 52 or 62 °C, 6 min at 72 °C and 
a final extension step of 10 min at 72 °C. The 
amplification products were evaluated on 1% agarose gel 
and the bands were extracted and purified. Samples were 
quantified by absorbance in 260 nm and then submitted 
for sequencing analysis (ABI 3730 DNA Analyser - 
Thermo Fisher Scientific) with the same primers used for 
amplification. The sequences were analyzed with BioEdit 
( v7 . 2 .5 -http ://w w w . mbio . nc su.edu/bioedit/page2.html) 
and the search for similarity was carried out using the 
BLAST tool. The amplicons were compared with ants’ 
sequences and led to an identity of over 90 %. 
Acetylcholinesterase sequence from Acromyrmex 
echinatior (GenBank GL888 116.1) was used to design 
primers with the inclusion of site for restriction enzymes 
and exclusion of signal peptide (Table 1). The PCR 
reaction was performed similar as described above with 
0.2 pM of each primer, an annealing temperature of 63°C 
in 30 cycles with an extension time of 3 min. The reaction 
product was analyzed on 1% agarose gel, purified and 
sequenced. 


ISSN: 2456-1878 

2.4 Primer design for quantitative real-time RT-PCR 
(RT-qPCR) 

Using the sequence from the amplicons, new primer pairs 
were designed, for each gene, using Primer Express® 
Software Version 3.0 and selecting the amplicon length 
between 50 and 150 pb. Among the various possibilities 
of primers provided by the software output, the selection 
of the primer pair was based on the low score penalty and 
smaller size of the amplicon. Primer sequences and 
amplicon characteristics are summarized in Table 2 for 
each candidate gene and for the AChE gene. 

2.5 Quantitative Real-time PCR (qPCR) 

The minimal primer concentration was determined using 
two-by-two combinations of forward and reverse primers 
in 100, 150 and 300 nM, in duplicate, and a non-template 
control for each combination. RT-qPCR was performed in 
an Applied Biosystem StepOnePlus™ system (Thermo 
Fisher Scientific) with a total reaction volume of 12 pL, 
containing 6 pL Power SYBR® Green PCR Master 
Mix (Thermo Fisher Scientific), 3 pL of forward and 
reverse primers in the appropriate concentration to give 
the relation described above and 3 pL of cDNA 
previously 30-fold diluted. Cycling conditions were: 10 
min at 95 °C (polymerase activation) followed by 40 
cycles at 95 °C during 10 s (denaturation) and 60 °C 
during 1 min (annealing/extension). For each reaction, the 
dissociation of the PCR products (melting curve) was 
analyzed from 60 to 95 °C to ensure the specificity of the 
amplified product. 

The appropriate primer concentration was used to 
determine the RT-qPCR efficiency by a relative standard 
curve for each candidate reference genes. For this 
purpose, a 5 -fold serial dilution of the cDNA was used as 
a template molecule for candidate reference genes and a 
2-fold serial dilution for acetylcholinesterase. Samples 
were analyzed in triplicate plus a negative control. RT- 
qPCR efficiency was calculated according to the equation 
1, in which the slope comes from the plot of Ct values 
against the logarithm of cDNA concentration [10], 
Efficiencies between 90% and 110% were used for 
further statistical analysis (Table 2). 

E = (10~ 1/slope - 1) xlOO (1) 

Once the optimum primers and cDNA concentrations 
were determined for each gene, the gene expression 
analysis was performed by RT-qPCR in the conditions 
already described above using 3 pL of cDNA diluted 60 
times. The reaction was performed in triplicate and with a 
non-template control for each conversion reaction of 
cDNA. 

2.6 Data Analysis and Statistics 
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2.6.1 Expression level analysis 

The expression level of candidates for reference gene was 
analyzed by standard deviation, coefficient variation and 
Student’s f-test. The t-test was used to verify if the mean 
value of expression levels between two different stages of 
development are statistically different or not. The same 
procedure was adopted to analyze if there is significant 
difference among the expression levels between different 
parts of the ant body. For this comparison, the first 
procedure is to calculate the pooled estimate of standard 
deviation (2), followed by the calculation of the 
experimental t-value (3), where S, n and x are the standard 
deviation, degrees of freedom and means, respectively, 
for the two analyzed genes. If the experimental t-value is 
lower than the critical t-value then there is no significant 
difference between the mean values of the gene 
expression at a 95% of confidence level [26], 

' r V *,+ h } -2 ' I + 

( 2 ) 

, .jfLlfi! I "i"i . . 1*1 ~**| I 

S, s , V //, + n. 

(3) 

In addition, the source of the standard deviation was 
compared in the analyses by comparing the variability of 
data from replicates with data related to the different body 
parts or different life cycle stages. 

The autoscaling preprocess also was used in order to 
obtain a better visualization of the most similar variables. 
This strategy is well known in chemometrics to normalize 
all the variables (expression levels, in this case) in order 
to minimize the differences in the intensity among them 
[27], This preprocess is performed by subtracting the 
mean value of the genes expression in each 
developmental stage from the total mean of the same gene 
for all developmental stages, followed by division by the 
standard deviation of the gene for all developmental 
stages. 

All of the tests were performed at 95% confidence level 
using the Microsoft Excel® software. 

The relative expression level of the target gene was 
obtained according to the relative quantification by 2" AACt 
method [28]. For developmental stage larva was used as 
calibrator and mesosoma for tissue. The data was plot as 
mean + SEM and the analysis with 95% confidence using 
GraphPad Prism 5. 

2.6.2 Selection of reference genes 

The selection of the reference gene was performed using 
four algorithms, BestKeeper® version 1 [29], geNorm 
version 3 [30], NormFinder version v0.953 [18] and the 
comparative ACt method [31], RefFinder was used to 
compare and rank the reference gene candidates [32], In 
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addition, the results from these software were compared 
with the statistical analysis performed. 

2.6.3 BestKeeper 

BestKeeper is an Excel based spreadsheet software that 
uses raw data (Ct values) and reaction efficiency (E) to 
identify the best-suited standards and combines them into 
an index [29]. The output Table shows descriptive 
statistics for each reference gene candidate: the geometric 
mean (geo Mean), arithmetic mean (ar Mean), minimal 
(min) and maximal (max) value, standard deviation (SD), 
and coefficient of variation (CV). The x-fold over- or 
under-expression of individual samples are calculated 
based on the geometric mean. These results are corrected 
via RT-qPCR efficiency to exhibit minimal and maximal 
values considering the x-fold ratio and their SD (SD [± x- 
fold]). The stability of the reference gene candidate can 
be evaluated by the user considering the calculated 
variation, such as SD and CV. Reference genes can be 
ordered from the most stable (lowest variation) to the 
least stable (highest variation). Candidate genes with SD 
[± Ct] higher than 1 (= starting template variation by a 
factor of 2) can be considered inconsistent and it is 
recommended to exclude them from the calculation index 
[29], 

BestKeeper also tests individual samples for their 
integrity. To do this, x-fold values are used through an 
intrinsic variation (InVar) for a single sample. It has been 
suggested that samples with 3-fold over- or under- 
expression should be removed from the analysis due to 
high deviation that can be attributed to inefficient sample 
preparation, incomplete reverse transcription or sample 
degradation [29]. 

2.6.4 geNorm 

geNorm uses relative quantification data from raw Ct 
values by 2 _ACt . This software determines the expression 
stability of candidate reference genes based on the gene- 
stability measure [30]. The internal control gene stability 
measure M is defined as the average pairwise variation 
for that gene with all other tested reference genes, where 
the lowest value for M corresponds to the most stable 
candidate, and the highest corresponds to the least stable. 
Values that surpass the cutoff value of 1.5 are not 
considered stable. The program enables stepwise 
exclusion of the gene with the highest value of M and 
recalculation of M for the remaining genes ranking them 
according their expression stability. 

The second important parameter calculated by geNorm is 
the pairwise variation (V n /V n +i) between two sequential 
normalization factors (NF„ and NF n+ i) to obtain the 
minimal number of reference genes [30]. The cutoff value 
of 0.15 indicates that no additional gene, beyond the n 
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most stable genes, needs to be included for a reliable 
analysis. 

2.6.5 NormFinder 

NormFinder, a model-based approach for the estimation 
of expression variation. It is able to identify stably 
expressed genes in a set of reference gene candidates. The 
Ct values were transformed to a linear scale by the same 
method used for geNorm in order to prepare input data. 
The mathematical model of gene expression presented on 
a Visual Basic application for Microsoft Excel estimates 
the intragroup variation as well as the intergroup variation 
in all groups [18]. These variations are combined into a 
stability value, representing a practical measure of the 
systematic error that will be introduced when using the 
investigated gene. The software requirements are 8 
samples/groups and at least 3 candidate reference genes; 
5-10 candidates are recommended in order to obtain 
reliable results. The best reference gene candidates are 
ranked in an index that is based on stability values; a low 
stability value indicates the most stably expressed gene. 


2.6.6 Comparative ACt method 

This method compares the relative expression of “pairs of 
genes” within each sample to identify a useful reference 
gene [31]. The variation ACt for each two genes is 
obtained by the difference of Ct values. The mean, 
standard deviation and mean of the standard deviation 
related to the ACt are obtained and used to rank genes. 
Two genes are stably expressed or co-regulated if a 
constant ACt value is observed between two genes. A low 
deviation value shows a more stable expression due to a 
short variability. 

2.6.7 RefFinder 

RefFinder is a web-based tool 

(http://fulxie.Ofees.us/?type=reference) that considers the 
four algorithms described before to rank the candidate 
reference genes. It uses raw Ct as input data to obtain the 
rank provided by each program. Then, based on the ranks, 
a weight for each individual gene is calculated to obtain 
the final overall rank [32], 


Table.l: Primer pairs sequence to identify the sequence of candidate reference genes. 





Amplicon size 

Genes 

Function 

Primer sequence 3 (5 ’-3’) 

sequenced 




(bp) 

act 

Cytoskeletal structural protein 
involved in cell motility, structure and 
integrity 

F: GY GACGACGAMGTAGC 

R: TGCCAGATCTTCTCC 

259 

ef 1-alpha 

Elongation during polypeptide 
synthesis in the ribosome 

F: GACATTGCCTTGTGGAAG 

R: CAGTTGGCCTGGTAGGTGGC 

498 

ef 1-beta 

Elongation during polypeptide 
synthesis in the ribosome 

F: GTGGCAACCAACTCAGG 

R: GTGGACGAAGCTGGG 

177 

GAPDH 

Carbohydrate metabolism 

F: CAACTTYGARRTYSTCGAGG 

R: CCRWAYTCGTTGTCATACC 

436 

rpll8 

Encode a ribosomal protein that is a 

F: 

CG AT ATT A AT C ATA AGC ATG ATCG 

481 


component of the 60S subunit 

GA 

R: CTTATAACCGCAGCTGCGTC 


tbp 

Coordinate the initiation of 
transcription by RNA polymerase II 
promoter 

F: ATGGATCAGATGCTTCCG 

R: AGACCTGGAAATAGCTCTGG 

677 

18S rRNA 

Structural RNA constituent of subunit 

F: AGCCATGCATGTCTCAGTGC 

648 

40S of the ribosome 

R: CGCGACGGGATATTAGTTGG 


a F and R indicate forward and reverse primers, respectively. 


Table 2: Primer sequences and amplicon characteristics for reference gene candidates used in RT-qPCR analysis. 


Gene 

Sequence (5’-3’) 

Product Lenght 
(bp) 

Efficiency 

(%) b 

R 2c 

act 

F: TCCTCGCGCCGTCTTTC 

R: TTG ACCC AT ACCG ACC ATC A 

69 

98.2 

0.990 
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F: AGCCGCTGTTGCATTCGT 


ef 1-alpha 

R: 

TGACGGATACTTCCAACATATTGTC 

64 

95.1 

0.993 

efl -beta 

F: GGCAACCAACTCAGGCTGAT 

R: CAACGGAGTACATGAGGATTCG 

82 

99.8 

0.900 

GAPDH 

F: ATG ACG ACTGT AC ATGCG ATT AC A 

R: TCACGCCATAGCTTGCTTGA 

70 

97.4 

0.990 

rpll8 

F: CGAGATCATCACGTTCGATC 

R: CTGCATCAAGACTGTACGTTTTCC 

66 

97.9 

0.988 

tbp 

F: CAGCAGTCACAACAATTTCAACAA 

R: TCATTAGCATGCCACTCTGCAT 

75 

* 


18S rRNA 

F: CTGATCGCACGGTCTTAGCA 

R: CAGAACCTACCATCGACAGTTGAT 

73 

* 



a F and R indicate forward and reverse primers, respectively 
b RT-qPCR efficiency, calculated by the standard curve method 
c Determination coefficient 
‘Results will be discussed in item 3.3 


III. RESULTS 


3.1 Sample quality 

Despite the accurate validation of reference genes, several 
problems can directly influence the results during the 
sample processing and preparation. In general, these 
problems can be associated to factors such as sample 
storage, RNA extraction and quality, synthesis of cDNA 
with transcriptase reverse, primer design and 
normalization [33]. Agarose gel electrophoresis was used 
to confirm the integrity of the RNA extracted from A. 
sexdens rubropilosa (Fig. 1). As previously described for 
insects [34], only one intense RNA band can be seen in 
the denaturing gel, which corresponds to the two 
fragments of the 28S rRNA that co-migrate with 18S 
rRNA. 

The RNA extracted from worker and from gaster using 
the Trizol method was degraded. Valles and collaborators 
detected the presence of an endogenous component 
located in the abdomen of adult ants (terminal abdominal 
segments) from Nylanderia pubens Forel (Hymenoptera: 
Formicidae), and also in queens and alate ants, capable of 
degrading RNA [35]. This report has also showed that the 
addition of at least 50 mM EDTA leads to intact RNA. 
However, EDTA can inhibit subsequent transcription and 
the PCR reaction, which could include one more variable 
in RT-qPCR experiments [36], Therefore, new RNA 
extraction was carried out with the PureLink® RNA mini 
Kit producing intact RNA from the worker (Fig. 1, lane 3) 
and partially intact from the worker’s gaster (Fig. 1, lane 
6). RNA extracted by combining Trizol with the kit (from 
lavae, pupae and workers without gaster) (Fig. 1, lanes 1, 
2, and 7, respectively) and only with the kit (head and 
mesosoma) (Fig. 1, lanes 4 and 5) showed characteristic 
bands of intact RNA. 


There are divergent discussions about the influence of 
RNA integrity on RT-qPCR experiments. Some authors 
[36] suggest that RNA degradation can be tolerated since 
an amplicon with 70-250 bp is obtained, while other 
authors indicate that partially degraded RNA can give an 
imprecise result of genic expression [37]. Because of this, 
the RNA from the ant’s gaster was excluded from the 
analysis with exception of the BestKeeper algorithm that 
also analyze the sample integrity. 

3.2 Selection and procedure for obtaining the sequence 
of reference gene candidates and AChE gene 

The lack of genome information for A. sexdens 
rubropilosa was not an obstacle for gene validation: the 
sequences alignment of other ant nucleotides and the 
analysis of the conserved regions enabled the design of 
primers, which were used for obtaining amplicons from 
A. sexdens rubropilosa cDNA (Table 1). The choice of a 
reference candidate for analysis was made based on the 
reference gene for Solenopsis invicta, the closest insect 
(Formicidae) with described reference genes [12] and 
other insects [9, 13, 15, 21, 38, 39, 40], 

All seven candidate reference genes {act, efl -beta, efl- 
alpha, GAPDH, rp!18, tbp and 18S rRNA) were amplified 
by PCR using these primer pairs. The amplicons were 
sequenced and these sequences were used in a sequence 
similarity search, confirming the identity of the genes. 
The AChE sequence from A. sexdens rubropilosa without 
signal peptide can be accessed in GenBank KY464935. 
The DNA sequence amplified from the candidate 
reference genes and AChE was used to design specific 
primers for the RT-qPCR (Table 2). 
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3.3 Standardization of the conditions for Quantitative 
Real-time PCR (qPCR) 

The minimum primer concentration for each target gene 
was determined to minimize non-specific amplifications 
and to reach the maximum amplification efficiency [41], 
The proper combinations of primers were considered 
those that introduced the melting curve with a single 
peak, resulting in amplification reactions with lower Ct 
values and greater ARn. All samples showed a single 
peak in the melting curve. However, it was also observed 
one peak in the melting curve of the non-template control 
for the tbp gene suggesting the formation of a primer 
dimer. The primers pairs designed for the tbp gene could 
provide unreliable results and, therefore, this gene was 
excluded from this study. Nevertheless, analysis would be 
possible through the design of new primers for this gene. 
The best primer concentrations determined for the 
remaining six genes were used in the determination of the 
RT-qPCR efficiency for each gene. The 18S rRNA 
showed a high abundance of transcripts due to the low 
value of Ct (data not shown). The sample (cDNA) was 
diluted by a factor of 60 to verify the reaction efficiency, 
but the results were not satisfactory. The discrepancy 
between rRNA and mRNA has been discussed as a 
negative point in the use of rRNA in reference genes 
studies [30]; in addition, the necessity of high sample 
dilution prior to qPCR can lead on dilution errors [42], 
For these reasons, studies that analyzed this gene as a 
reference gene in insects also suggested the elimination of 
18S rRNA from the list of consensus genes [17, 21], 
Therefore, 18S rRNA gene was excluded from this study. 
RT-qPCR efficiency for act , e/7 -alpha, efl-beta, GAPDH, 
rpll8 and AChE was between 95.1 - 103.6 %, showing 
that they can be used for RT-qPCR analysis (Table 2). 
The relative expression level of the target was obtained 
by 2" AACt method, to this the target and reference should 
have amplification efficiencies approximated equal, the 
observation of how ACt varies with template dilution 
showed that the method can be used for analysis (data not 
shown) [28]. 

3.4 Statistical analysis 

3.4.1 Transcription profile of candidate reference 
genes 

Fig. 2a shows the genes plotted as function of their gene 
expression average at different developmental stages. The 
autoscaling preprocess was used in order to obtain a 
better view of the correlation among the variables. Fig. 2a 
shows that the most correlated genes are efl -alpha and 
efl-beta, followed by rpll8, being the genes act and 
GAPDH more intercorrelated. Therefore, the variables 
efl -alpha, efl-beta, and rpll8, are the most correlated 
variables and they present the low variability with the 


development stage, making these variables good 
candidates for reference genes. 

Fig. 2b shows the genes plotted as function of their gene 
expression average in different parts of the body using 
autoscaling, as explained before. As showed in this Fig., 
the most correlated genes with respect to different body 
parts are rpll8 and efl-beta, followed by GAPDH and 
efl -alpha. In this case, although the genes GAPDH and 
act present the best SD and CV values (data not shown), 
they do not present good correlation compared to the 
other variables. Then, the best choice for a reference gene 
will depend if the algorithm used seeks lower SD and CV 
values or the two most correlated variables. 

The Student’s t-test showed no significant difference for 
all genes, with a confidence level of 95%, when larvae 
and pupae were compared. However, there were 
significant differences in the expression of the genes 
when larvae and pupae were compared to workers, efl - 
alpha, efl-beta and rpll8 presented the most constant 
expression with the development stage. Similar results 
were obtained for the different tissues, where there were 
not significant differences for all genes expression levels 
when head and torax were compared. In addition, act was 
the only gene that didn’t present significant differences 
comparing any tissue by t-test at 95% of confidence. 
Again, the best selected gene will depend of the algorithm 
used for the genes evaluation. 

The comparison between the standard deviation for 
replicates from RT-qPCR experiment and replicates of 
converting RNA into cDNA showed that the deviation of 
the last one is 1.6 and 3.6 times higher than the first. This 
was expected since it is well known that the conversion of 
RNA into cDNA is the main source of data variability. 


MI2MJM 4 5 6 7 M 
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Fig. 1: Analyzes of RNA integrity by denaturing agarose gel 
electrophoresis 1.2% ( w/v). (M) molecular marker; RNA 
extracted from: 1) larvae; 2) pupae, 3) workers; 4) head; 5) 
mesosoma; 6) gaster; 7) worker without g aster. 
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Fig. 2: Averages of gene expression as function of the 
developmental stage (a) and tissue (b)for each candidate. The 
data was auto-scaled before the plot without gaster. 

3.4.2 BestKeeper 

Using the output values for Ct variation (SD [+ Ct]), the 
expression level of the candidates reference genes was 
analyzed and the ranking was constructed. As the SD [+ 
Ct] are below 1 for all of the five candidates from the 
developmental stages (larva, pupa and worker), it means 
that they can be considered stably expressed (Table 3). 
The gene stability in decreasing order for developmental 
stages is: GAPDH, rpl!8, efl -alpha, efl-beta, and act. 
The intrinsic variance (InVar) of expression for a single 
sample is below 3 and the highest value obtained was 
1.03. This result confirms the integrity of total RNA 
extracted from specimen of every developmental stage. In 
addition, the results agree with the statistical analysis 
where was verified that GAPDH present the lowest 
standard deviation. 

The tissue samples (head, mesosoma, and worker without 
gaster) including gaster the Ct variation (SD [+ Ct]) and 
up/downregulation (SD [+ x-fold]) showed higher values 
than 1 and 2, respectively. In this way, none of the 
candidate reference genes from the tissues could be used. 
The InVar (+ x-fold] values for the samples were higher 
than 3 for the gaster confirming RNA degradation as 
observed on the denaturing agarose gel (Fig. 1) and 
justifying the exclusion of this tissue from analysis. The 
gaster samples were eliminated and the data were 
analyzed again; this has led to acceptable values of SD [+ 
Ct], SD [+ x-fold] and InVar [+ x-fold] for all reference 
gene candidates. This proceeding also was adopted by 
Ponton and collaborators that also identified InVar [+ x- 
fold] > 3 when analyzing different treatments together 
from Drosophila melanogaster [17]. To overcome this, 
the authors suggested a separate analysis of the samples 
with different treatments. 

The stability of the genes obtained by BestKeeper, in 
decreasing order for tissues (head, mesosoma, and worker 
without gaster), was GAPDH, act, rpll8, efl -1 beta and 
efl -alpha (Table 3). 


3.4.3 geNorm 

Vandesompele and collaborators described this robust and 
innovative strategy to identify the most stably expressed 
control genes in a given set of tissues, and to determine 
the minimum number of genes required to calculate a 
reliable normalization factor [30]. These authors also 
suggested the use of at least three reference genes to 
increase the confidence of the analysis when the 
suggested number of genes is too high or the sample 
limited. 

First geNorm calculates the gene stability measure (M); 
the genes presenting M < 1.5 are considered stable. Here, 
the five candidate reference genes, considering both the 
development stages and different tissues, could be 
considered for use as reference genes. 

The candidate reference genes were ranked after stepwise 
exclusion of the highest M value (Table 3, Fig. 3), which 
results in a combination of two constitutively expressed 
genes that exhibit the most stable expression in the tested 
samples. The decrease of the M value during this analysis 
reflects the differences in the stability of reference gene 
candidates associated with the highest stability of the 
remaining genes. In this way, it is clear that act and 
GAPDH present an unstable expression, represented by 
the decrease of the M value after removal of these genes 
(Fig. 3) in both groups of analyzed samples (development 
stages and tissues). 

Therefore, the most stable genes are efl -alpha and efl- 
beta for the different developmental stages and efl-beta 
and rpll8 for the different tissues. These results perfectly 
agree with the spatial representation of the gene 
expression levels presents in Fig. 2. 

Vandesompele and collaborators also demonstrated the 
large errors associated with the use of a single gene as 
reference gene [30]. To obtain reliable results for gene 
expression analysis, geNorm determines the minimum 
number of genes to be used as reference genes in a 
particular experiment. To do that, the pairwise variation 
was individually determined for each gene starting with 
the two most stable genes (n=2) with the sequential 
addition of the other least stable genes (Vn/ n +i). The 
optimum number of reference genes was determined by 
the levels of variation in the average reference gene 
stability. V 2/3 values are below the threshold value of 0.15 
(Fig. 4). Then, geNorm tool indicates that the use of only 
two genes, the most stable ones, is sufficient to obtain 
accurate results for normalization experiments in RT- 
qPCR analysis from the different developmental stages 
and tissues for A. sexdens rubropilosa. 

3.4.4 NormFinder 

The mathematical model of NormFinder considers the 
inter- and intra-group variation to estimate the gene 
stability and rank genes with minimal variation. 
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eliminating problems associated with the selection of co- 
regulated genes [18]. For the different developmental 
stages, the most stably expressed candidate gene was 
rpl!8 with the lowest variability value. The decreasing 
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order of gene stability is rpI18, efl -alpha, efl-beta, 
GAPDH and act (Table 3). For different tissues, the 
decreasing order was rpI18, efl-beta, GAPDH, efl -alpha 
and act. 



Fig-3: Gene expression analysis by geNorm. Expression stability and ranking of 5 candidate reference genes. The M value { indicates the 
average expression stability) is lower for the most stable expression. A) Developmental stages: B ) Tissues 


6 30-r 



Dc«'#*oom*nt* Tissue 


Fig. 4: Paim’ise variation of candidate reference genes for 
determination of the optimal number of control genes for 
accurate normalization. Pairwise variation (VJn+i) analysis 
between the normalization factors NF n and NF„+i 

Andersen and collaborators that elucidated the 
discrepancies caused by the differences between the 
approaches, due to the tendency of pairwise comparison 
to select genes with highest degree of similarity in their 
expression profile [18], foresaw the difference in rank 
obtained by NormFinder and geNorm for developmental 
stages. This is a problem when there are co-regulated 
genes between the candidates, they usually have a 
tendency to show very similar expression profiles and be 
top ranked, independently of their expression stability 
[18]. This can be the reason for which geNorm ranked 
efl -alpha and efl -beta genes as the best genes to be used 
as reference for different developmental stages. 

3.4.5 The comparative ACt method 
The ACt method compares pairs of genes, similarly to 
geNorm, and uses ACt to estimate the gene variability 
[31], Changes in gene variability were observed by the 
increase or decrease on the deviation of ACt among all 
possible combinations between candidate reference genes. 
The analysis of the most stable gene was done comparing 
the mean of the standard deviation of ACt. The lowest 
values correspond to lower variability for this gene. 


which establishes it as the most stable gene. For the 
developmental stages, rpll8, efl-alfa, and efl-beta genes 
showed the lowest and similar deviation (Table 3). The 
rank in decreasing order of stability for the developmental 
stages was rpl!8, efl -alpha, efl-beta , GAPDH and act. 
For the tissue samples the decreasing order of stability 
was rpll8, efl-beta , efl -alpha GAPDH and act. 

3.4.6 RefFinder 

The stability of the candidate reference genes was 
evaluated with four different algorithms (BestKeeper, 
geNorm, NormFinder and comparative ACt method). 
Differences in the mathematical model for each one result 
in different ranks for gene stability, but the methods are 
equally important [9, 17], RefFinder was used to propose 
a consensus list for ranking the reference gene for A. 
sexdens rubropilosa (Table 3), showing, for the 
developmental stages, in a stability decreasing order: 
rpll8, efl -alpha, efl-beta, GAPDH and act. For the 
tissues, the decreasing order of stability was rpll8, efl- 
beta, GAPDH, efl -alpha and act. 

3.4.7 Expression of the AChE from A. sexdens 
rubropilosa 

The RT-qPCR data for AChE normalized by each of the 
candidate reference genes are presented in Fig. 5a and 
Fig. 5b, for developmental stages and for tissues, 
respectively. As predicted, the results show a difference 
in quantification depending on the gene used to 
normalize. In worker the AChE expression level 
normalized with act and GAPDH was about 7. 3 -fold and 
about 4.2-fold lower than those normalized with rpll8 
and efl -alpha (P <0,0001). The great difference in tissues 
was in worker without gaster when the data were 
normalized with act , exhibiting an expression level 2-fold 
higher when compared with rpll8 (P=0,0012). However, 
the statistical analysis for the two top ranked genes by 
RefFinder for developmental stages ( rpll8 and efl -alpha) 
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and for tissues ( rp!18 and efl-beta ) showed no significant difference (P>0,05). 

Table. 3: Rank of reference genes in decrease order based on their expression stability according to BestKeeper, geNorm, 
NormFinder, comparative ACt method and RefFinder. The values were obtained after individual analysis of each software. 


Developmental stage 

Tissues 

BestKeepe 

r 

geNorm 

NormFinde 

r 

ACt 

method 

RefFinde 

r 

BestKeepe 

r 

geNorm 

NormFinde 

r 

ACt 

method 

RefFinde 

r 

GAPDH 

(0.50) 

efl- 
alpha / 
efl -beta 

(0.21) 

rpll 8 
(0.135) 

rpll 8 
(0.73) 

rpll 8 
(1.57) 

GAPDH 

(0.33) 

efl- 

beta/ 
rpll 8 
(0.16) 

rpll 8 
(0.028) 

rpll 8 
(0.30) 

rpll 8 
(1.32) 

rpll 8 
(0.58) 

rpll 8 
(0.26) 

efl -alpha 
(0.143) 

efl- 

alpha 

(0.74) 

efl -alpha 
(1.86) 

act 

(0.38) 

efl- 

alpha 

(0.20) 

efl-beta/ 

GAPDH 

(0.050) 

efl -beta 

(0.31) 

efl-beta 

(2.00) 

efl -alpha 
(0.59) 

GAPD 

H (0.71) 

efl -beta 

(0.176) 

efl -beta 

(0.79) 

efl-beta 

(2.45) 

rpll 8 
(0.41) 

GAPD 

H (0.27) 

efl -alpha 
(0.060) 

efl- 

alpha 

(0.34 

GAPDH 

(2.83) 

efl-beta 

(0.71) 

act 

(0.89) 

GAPDH 

(0.218) 

GAPD 

H ( 1 .02) 

GAPDH 

(2.83) 

efl -beta 

(0.44) 

act 

(0.37) 

act 

(0.140) 

GAPD 

H (0.39) 

efl -alpha 
(3.41) 

act 

(0.74) 


act 

(0.254) 

act 

(1.16) 

act 

(5.00) 

efl -alpha 
(0.49) 



act 

(0.51) 

act 

(3.98) 


The parameter for each software was standard deviation 
of the Ct (SD [±Ct]) for BestKeeper, expression stability 
value for NormFinder, M value after stepwise exclusion 
of the highest M value for geNorm, mean of standard 
deviation of ACt for ACt method and geomean of ranking 
values for RefFinder. 

The expression level for AChE in developmental stages, 
when the AChE data were normalized considering the 
two top ranked gene, enhance from larva to worker (Fig. 
5a). AChE from Anopheles gambiae also showed a 
similar expression pattern and some works have shown 


that this enzyme also exhibit a noncholinergic functions 
associated with insect development [43, 44, 45], In 
tissues, there is a small variation in the expression of 
AChE in the head compared to the mesosoma, while in 
worker without gaster, that is the junction of the other 
two, the expression is almost 2-fold higher than in 
mesosoma (Fig. 5b). In A. gambiae a higher AChE2 
expression was observed in abdomen than in head [45], 
Until now, none classification has been done for AChE 
from A. sexdens rubropilosa and we are working in 
another analysis for an accurate classification. 
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Fig. 5 .Relative quantification of AChE in A. sexdens rubropilosa. Expression profile of target gene normalized with different 
candidate reference genes in three A) developmental stages and B) tissues. Data are presented as mean+SEM of biological 

triplicate. 


IV. DISCUSSION 

RT-qPCR has been widely used for gene expression 
analysis due to the high accuracy, however, the reliability 
of the results are strictly correlated with the genes used as 
reference genes. Due to this, the validation of reference 
gene is necessary and there are several works suggesting 


the importance of the validation for each organism and 
experimental condition. This is the first work to validate 
reference gene for A. sexdens rubropilosa under biotic 
condition for different developmental stages and tissues. 
The results obtained here can support research in this 
field once leaf-cutting ants is considered as pests in 
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agricultural crop or reforestation areas mainly in South 
America. 

For the validation for reference gene was done statistical 
analysis of the data, beside this, different statistical 
algorithms such as BestKeeper, geNorm, NormFinder, the 
comparative ACt method and RefFinder were used to 
verify the stability of the genes selected. Once the 
reference gene can be regulated to some extent, a 
combination of reference genes should be used, and as 
indicated by geNorm two reference genes are enough to 
obtain accurate results. So, we suggest the use of rp!18 
and e/7 -alpha for developmental stages and rpll8 and 
efl-beta for tissues for genomic analysis in A. sexdens 
rubropilosa, based on consensus list provided by 
RefFinder. For S. invicta rpll8 and elongation factor 
(beta) were the most stable genes for expression in 
different developmental stages, castes and tissues [12], 
The similar results from these two studies was not 
obvious and experimental results were necessary, once 
the expression stabilities of HKGs were not conserved 
among evolutionarily close species [11, 14]. 

The expression stability values for candidate reference 
genes are higher for samples from developmental stage 
than tissue for all algorithm analyzed (Table 3) and this 
result can be associated with higher complexity of the 
sample [21], The transcript profiles from adult stage can 
change during eclosion process from pupa to adult, as 
predicted for S. invicta resulting in an increase of sample 
complexity [46]. Moreover, the fact that mature leaf- 
cutter ant colonies have one of the most complex 
polymorphic worker caste within ants can contribute to 
this pattern [24, 47]. 

Analysis using standard deviation (statistical analysis and 
BestKeeper algorithm) ranked GAPDH as one of the most 
stable genes for both development stages and different 
tissues. All other algorithms listed rpll8/efl -alpha 
(developmental stages) and rp!18/efl-beta (tissues) as the 
most stable ones. On the other hand, act was classified as 
the worst for almost all approach, act has not been ranked 
for other Hymenoptera [12, 14], but showed a controversy 
results for insects from Lepdoptera [11]. This result can 
be validate by the large number of genes involved in actin 
cytoskeleton organization identified in A. ceplialotes 
compared to other hymenopteran genes that are 
associated to the extensive cytoskeletal changes that 
occur during caste differentiation in Alta adults [24], 

In conclusion, we analyzed five candidate reference genes 
in two different samples from A. sexdens rubropilosa 
with different statistical approaches, a consensus list from 
stability of genes was obtained and the two top ranked 
gene were suggested as reference genes for this insect. 
The AChE expression pattern normalized with different 
candidate reference genes emphasize the importance of 
validation to obtain reliable and accurate results from 


Vol-2, Issue -2, Mar- Apr- 2017 
ISSN: 2456-1878 

gene expression analysis. Beside this, the expression 
analysis from AChE suggest that this enzyme is important 
in developmental stage growing from larva to worker and 
is spread on insect body. The results presented are 
essential to gene expression analysis in this leaf cutting 
ant associated with low genome information and the 
growing interest in pest management control 
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